3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC User Agreement for Non-Members
Size:
176000 words Production Status:
Existing-used
Use:
Discourse
-
Paper title:Developing the Bangla RST Discourse Treebank
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Debopam Das | University of Potsdam | DE |
| Author 2 | Manfred Stede | University of Potsdam | DE |
| Main Contact | Debopam Das | University of Potsdam | None |
Documentation:
Das, D., & Taboada, M. (to appear). RST Signalling Corpus: A corpus of signals of coherence relations. Language Resources & Evaluation.
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
Freely Available
License:
Gnu
Size:
120000 tokens Production Status:
Existing-used
Use:
Annotation and Educational
-
Paper title:LAMP: A Multimodal Web Platform for Collaborative Linguistic Analysis
-
Paper track:Multimodality
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Kais Dukes | University of Leeds | None |
| Author 2 | Eric Atwell | University of Leeds | None |
| Main Contact | Kais Dukes | University of Leeds | GB |
Documentation:
http://corpus.quran.com/documentation
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English
Availability:
From Data Center(s)
License:
LDC (planned)
Size:
1.2 mil <Not Specified>Production Status:
Newly created-finished
Use:
parsing, parallel parsing, machine translation, coreference resolution, anaphora resolution, natural language generation, lexical acquisition
-
Paper title:Announcing Prague Czech-English Dependency Treebank 2.0
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jan Hajič | Charles University in Prague | None |
| Author 10 | Jan Popelka | Charles University in Prague | None |
| Author 11 | Jiří Semecký | Charles University in Prague | None |
| Author 12 | Jana Šindlerová | Charles University in Prague | None |
| Author 13 | Jan Štěpánek | Charles University in Prague | None |
| Author 14 | Josef Toman | Charles University in Prague | None |
| Author 15 | Zdeňka Urešová | Charles University in Prague | None |
| Author 16 | Zdeněk Žabokrtský | Charles University in Prague | None |
| Author 2 | Eva Hajičová | Charles University in Prague | None |
| Author 3 | Jarmila Panevová | Charles University in Prague | None |
| Author 4 | Petr Sgall | Charles University in Prague | None |
| Author 5 | Ondřej Bojar | Charles University in Prague | None |
| Author 6 | Silvie Cinková | Charles University in Prague | None |
| Author 7 | Eva Fučíková | Charles University in Prague | None |
| Author 8 | Marie Mikulová | Charles University in Prague | None |
| Author 9 | Petr Pajas | Charles University in Prague | None |
| Main Contact | Ondřej Bojar | Charles University in Prague | CZ |
Documentation:
http://ufal.mff.cuni.cz/pcedt2.0/en/documentation.htmlLanguage Type:
Multilingual
Languages:
English
Availability:
Not Available
License:
<Not Specified>
Size:
1255 entries Production Status:
Newly created-in progress
Use:
Text Normalization
-
Paper title:Two Database Resources for Processing Social Media English Text
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Eleanor Clark | Hokkaido University | None |
| Author 2 | Kenji Araki | Hokkaido University | None |
| Main Contact | Eleanor Clark | Hokkaido University | JP |
Documentation:
<Not Specified>
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
3600 entries Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Finding Non-Arbitrary Form-Meaning Systematicity Using String-Metric Learning for Kernel Regression
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Outstanding
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | E.Dario Gutierrez | University of California, San Diego | N/A |
| Author 2 | Roger Levy | Massachusetts Institute of Technology | N/A |
| Author 3 | Benjamin Bergen | University of California, San Diego | N/A |
| Main Contact | E. Dario Gutierrez | University of California, San Diego | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
CreativeCommons
Size:
2 billion words Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Literal and Metaphorical Senses in Compositional Distributional Semantic Models
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | E.Dario Gutierrez | University of California, San Diego | N/A |
| Author 2 | Ekaterina Shutova | University of Cambridge | N/A |
| Author 3 | Tyler Marghetis | University of Indiana | N/A |
| Author 4 | Benjamin Bergen | University of California, San Diego | N/A |
| Main Contact | E. Dario Gutierrez | University of California, San Diego | None |
Documentation:
Available in English on website
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Attribution-NonCommercial-ShareAlike 4.0 International
Size:
57.8 <Not Specified>Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Annotated Corpus of Scientific Conference's Homepages for Information Extraction
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Piotr Andruszkiewicz | Warsaw University of Technology | PL | ||
| Author 2 | Rafal Hazan | Institute of Computer Science, Warsaw University of Technology | PL | Institute of Computer Science, Warsaw University of Technology | N/A |
| Main Contact | Piotr Andruszkiewicz | Warsaw University of Technology | None |
Documentation:
a README file in tgz fileLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
2 KByte Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Archna Bhatia | Carnegie Mellon University | US |
| Author 2 | Mandy Simons | Carnegie Mellon University | US |
| Author 3 | Lori Levin | Carnegie Mellon University | US |
| Author 4 | Yulia Tsvetkov | Carnegie Mellon University | US |
| Author 5 | Chris Dyer | Carnegie Mellon University | US |
| Author 6 | Jordan Bender | University of Pittsburgh | US |
| Main Contact | Archna Bhatia | Florida Institute for Human and Machine Cognition | None |
Documentation:
<Not Specified>
Written
Web Service,
Language Type:
Trilingual
Languages:
Dutch English German
Availability:
Freely Available
License:
<Not Specified>
Size:
2095000 entries Production Status:
Newly created-in progress
Use:
Variational Linguistics/Computational Sociolinguistics
-
Paper title:Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Dirk Hovy | Center for Language Technology, University of Copenhagen | DK | Center for Language Technology, University of Copenhagen | IT |
| Author 2 | Anders Johannsen | University of Copenhagen | DK | ||
| Main Contact | Dirk Hovy | Center for Language Technology, University of Copenhagen | None | Bocconi University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German Russian Spanish
Availability:
Evaluation
License:
Unspecified
Size:
56 MByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Evaluation of English-Czech MT through Paraphrasing
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Petra Barancikova | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 2 | Rudolf Rosa | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 3 | Ales Tamchyna | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Main Contact | Petra Barancikova | Charles University in Prague, Faculty of Mathematics and Physics | None |
Documentation:
<Not Specified>




